Cocktail Party Solutions: Mixing techniques
نویسنده
چکیده
The human auditory system has an unparalleled ability to solve what is known as the cocktail problem: the task of perceptually separating superimposed acoustic sources in a noisy environment. Information about the auditory system can inform our choices of representation and computation. Recent progress has been made from several diier-ent approaches to the acoustic source separation problem. Computational auditory scene analysis (CASA) and independent components analysis (ICA) represent two extremes in a spectrum deened by the dependence of the technique upon prior knowledge versus the richness of the input data. I suggest that some of these approaches can be combined and that the combination will require information about the intrinsic qualities of sources as well as about their relationships with each other. Framing these techniques in terms of probability will allow us to learn a network of modules based on intrinsic properties of natural sounds. Interaction between modules at diierent levels of analysis is necessary for negotiating a consistent grouping policy. This negotiation process requires representations that can accomodate ambiguity among potential grouping hypotheses. These grouping hypotheses should be continuously variable to allow for negotiation between different constraints. Negotiating between source-separation possibilities requires complexity to be minimized using regularization techniques.
منابع مشابه
Approaching the Time Dependent Cocktail Party Problem with Online Sparse Coding Neural Gas
We show how the “Online Sparse Coding Neural Gas” algorithm can be applied to a more realistic model of the “Cocktail Party Problem”. We consider a setting where more sources than observations are given and additive noise is present. Furthermore, we make the model even more realistic, by allowing the mixing matrix to change slowly over time. We also process the data in an online pattern-by-patt...
متن کاملImproved Cocktail - Party Processing
The human auditory system is able to focus on one speech signal and ignore other speech signals in an auditory scene where several conversations are taking place. This ability of the human auditory system is referred to as the “cocktail-party effect”. This property of human hearing is partly made possible by binaural listening. Interaural time differences (ITDs) and interaural level differences...
متن کاملAuto-associative Memory: The First Step in Solving Cocktail Party Problem
One of the most interesting and challenging problems in the area of Artificial Intelligence is solving the Cocktail Party problem. This is the task of attending to one speaker among several competing speakers and being able to switch the attention from one speaker to another at any given time. Human brain is remarkably efficient in solving this problem. There have been numerous attempts to emul...
متن کاملA Systematic Approach to Adaptive Algorithms for Multichannel System Identification, Inverse Modeling, and Blind Identification
In many situations related to acoustics and data communications we are confronted with multiple signals received from a multipath mixture, e.g., the famous cocktail-party problem. A multipath mixture can be described by a mixing matrix, whose elements are the individual transfer functions between a source and a sensor. The mixing matrix is usually unknown, and so are sometimes also the source s...
متن کاملUsing beamforming in the audio source separation problem
The problem of separating audio sources observed in a real room environment is a very challenging task, also known as the cocktail party problem. Much work has been presented on audio separation, even in cases of high reverb. However, various problems remain unsolved in a real-world scenario. In this paper, the authors review proposed solutions employing Independent Component analysis (ICA), di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999